t 



GenCore version 5.1.3 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 



January 11, 2003, 10:44:48 ; Search time 2673 Seconds 

(without alignments) 
9015.011 Million cell updates/sec 

US-09-602-740-1 

828 

1 gaggagcttcgccacatgga gcagcgccagctctaacaag 828 



Scoring table: I DENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



2054640 seqs, 14551402878 residues 



Total number of hits satisfying chosen parameters: 

Minimum D3 seq length; 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



4109280 



Database 



GenEmbl: * 



1 




gb ba : * 




2 




gb htg: 


* 


3 




gb in:* 




4 




gb om : * 




5 




gb ov: * 




6 




gb pat : 




7 




gb ph : * 




8 




gb pi : * 




9 




gb pr : * 




10 


gb ro : 


* 


11 


gb sts 




12 


gb sy : 


* 


13 


gb un: 


* 


14 


gb vi : 


* 


15 


em ba : 


* 


16 


em fun 




17 


em hum 




1* 




em in: 


* 


19 


em mu : 


■k 


20 


em om: 


* 


21 


em or : 


■k 


22 


em ov : 


* 


23 


em pat 


. * 


24 


em ph : 


* 


25 


em pi : 


* 


26 


em ro : 


* 


27 


em sts 





28 


em 


un : * 


29 


em 


vi : * 


30 


em 


htgjnum: * 


31 


em 


htg inv:* 


32 


em 


htg other:* 


33 


em 


htg mus : * 


34 


em 


htg pin: * 


35 


em 


htg rod:* 


36 


em 


htg mam: * 


37 


em 


htg vrt : * 


38 


em 


sy : * 


39 


em 


htgo hum:* 


40 


em 


htgo mus:* 


41 


em 


htgo other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

0. 

o 



Result Query 





No . 


Score 


Match 


Length 


DB 


ID 


Hp grri oi~ i on 




1 


828 


100 . 


o 


828 


6 


AX064875 


AX064875 Sequence 




2 


828 


100 . 


0 


6995 


6 


AX074270 


AX074270 Sequence 




3 


828 


100 . 


0 


6995 


6 


AX076272 


AX076272 Sequence 




4 


828 


100. 


0 


338150 


1 


AP005278 


AP005278 Corynebac 




5 


828 


100. 


0 


349980 


6 


AX127148 


AX127148 Sequence 




6 


705 


85. 


1 


705 


6 


AX121829 


AX121829 Sequence 




7 


480 


58. 


0 


3038 


6 


AX074275 


AX074275 Sequence 


c 


8 


149. 6 


18. 


1 


15424 


1 


AE007019 


AE007019 Mycobacte 


c 


9 


149.6 


18. 


1 


40790 


1 


MTCY493 


Z95844 Mycobacteri 




10 


134.8 


16. 


3 


35881 


1 


U00013 


U00013 Mycobacteri 


c 


11 


134.8 


16. 


3 


312050 


1 


MLEPRTN3 


AL583919 Mycobacte 




12 


104 


12. 


6 


1080 


6 


AX066175 


AX066175 Sequence 




13 


84 


10. 


1 


960 


6 


AX074273 


AX074273 Sequence 




14 


84 


10. 


1 


960 


6 


AX074278 


AX074278 Sequence 




15 


81 


9. 


8 


957 


6 


AX121828 


AX121828 Sequence 


c 


16 


73.2 


8. 


8 


331 


6 


AR128446 


AR128446 Sequence 




17 


72.2 


8. 


7 


22115 


1 


SCC22 


AL096839 Streptomy 




18 


61.2 


7 . 


4 


137478 


2 


AP004704 


AP004704 Oryza sat 




19 


60.8 


7. 


3 


1024 


8 


AY084434 


AY084434 Arabidops 


c 


20 


60.8 


7. 


3 


80367 


8 


AC012329 


AC012329 Arabidops 


c 


21 


60.8 


7. 


3 


129757 


8 


ATF2K15 


AL132956 Arabidops 




22 


58. 6 


7 . 


1 


1250 


8 


AY086161 


AY086161 Arabidops 


c 


23 


58. 6 


7. 


1 


45529 


8 


AB016884 


AB016884 Arabidops 




24 


57. 4 


6. 


9 


154029 


2 


AC120538 


AC120538 Oryza sat 


c 


25 


56.8 


6. 


9 


148142 


1 


D90916 


D90916 Synechocyst 




26 


55. 4 


6. 


7 


10548 


1 


AE004417 


AE004417 Vibrio ch 


c 


27 


55.4 


6. 


7 


21149 


2 


AC013910 


AC013910 Drosophil 




28 


55.4 


6. 


7 


99701 


2 


AC125292 


AC125292 Drosophil 




29 


55.4 


6. 


7 


179312 


3 


AC023745 


AC023745 Drosophil 




30 


55. 4 


6. 


7 


314087 


3 


AE003484 


AE003484 Drosophil 




31 


55. 4 


6. 


7 


334028 


2 


AC116537 


AC116537 Drosophil 


c 


32 


54. 6 


6. 


6 


10195 


1 


U32737 


U32737 Haemophilus 


c 


33 


54.2 


6. 


5 


132685 


2 


AP005175 


AP005175 Oryza sat 



GenCore version 5.1.3 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: January 11, 2003, 09:05:53 



; Search time 229 Seconds 

(without alignments) 

3142.597 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-09-602-740-1 

828 

1 gaggagcttcgccacatgga . 



. gcagcgccagctctaacaag 828 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

2185239 seqs, 1125999159 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

M^YiTniiTTi DB *e*n length: 2000000000 



4370478 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



N_Geneseq_101002 : * 

/S I DS2/gcgdat a/genes eq/ gene se 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2 /gcgdata/geneseq/genese 
/SIDS2 / gcgdata/ gene seq/ gene se 
/SIDS2/gcgdata/ gene seq/ gene se 
/SIDS2 / gcgdata /gene seq/ genes e 
/S I DS2 /gcgdata/ gene seq/ gene se 
/SI DS2 /gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/ genes 
/SIDS2/gcgdata/geneseq/ genes 
/S I DS2 /gcgdata/ gene seq/ genes 
/SIDS2/gcgdata/geneseq/ genes 
/S I DS2 /gcgdata /gene seq/ genes 
/S I DS2 /gcgdata /genes eq/ genes 
/S I DS2 /gcgdata /gene seq/ genes 
/S I DS2 /gcgdata /genes eq/ genes 
/S I DS2 /gcgdata /genes eq/ genes 
/S I DS2 /gcgdata /geneseq/genes 
/S I DS2 /gcgdata /genes eq/ genes 
/S I DS2 /gcgdata /geneseq/genes 
/S I DS2 /gcgdata /geneseq/genes 
/S I DS2 /gcgdata /gene seq/ genes 
/S I DS2 /gcgdata /geneseq/genes 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 



qn-embl/NA1980 . DAT : * 
qn-embl/NA1981 . DAT: * 
qn-embl/NAl982 . DAT : * 
qn-embl/NA1983.DAT: * 
qn-embl/NA1984 .DAT: * 
qn-embl/NA1985.DAT: * 
qn-embl/NA198 6.DAT: * 
qn-embl/NAl 987 . DAT : * 
qn-embl/NA1988.DAT: * 
eqn-embl/NA198 9.DAT: 
eqn-embl/NA1990 . DAT : 
eqn-embl/NA1991 . DAT : 
eqn-embl /NA1 9 92 . DAT : 
eqn-embl/NAl 993 . DAT : 
eqn-embl/NA1994 .DAT: 
eqn-embl /NA1 995. DAT : 
eqn-embl /NA1 9 9 6. DAT : 
eqn-embl /NA1 9 97 . DAT : 
eqn-embl/NA1998 . DAT : 
eqn-embl /NA1 9 9 9. DAT: 
eqn-embl /NA2000 . DAT : 
eqn-embl /NA2001A. DAT 
eqn-embl /NA200 IB. DAT 
eqn-embl /NA2 002 . DAT : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

0. 

o 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


828 


100. 


0 


828 


22 


AAF71360 


Corynebacterium gl 




2 


828 


100 . 


0 


6995 


22 


AAF25332 


Nucleotide sequenc 




3 


828 


100 . 


o 


6995 


22 


AAF29332 


C. glutamicum OpcA 




4 


828 


100 . 


0 


349980 


22 


AAH68529 


C glutamicum codin 




5 


705 


85 . 


1 


705 


22 


AAH66710 


C glutamicum codin 




6 


480 


58 . 


o 


3038 


22 


AAF29319 


C. glutamicum OpcA 


c 


7 


149 


.6 


18. 


1 


4411529 


22 


AAI99682 


Mycobacterium tube 




8 


104 


12. 


6 


1080 


22 


AAF71023 


C. glutamicum SRT 




9 




84 


10. 


1 


960 


22 


AAF29320 


C. glutamicum OpcA 




10 


84 


10. 


1 


960 


22 


AAF29333 


C. glutamicum OpcA 




11 


81 


9. 


8 


957 


22 


AAH66709 


C glutamicum codin 


c 


12 


73 


.2 


8. 


8 


331 


21 


ABQ62809 


Mycobacterium tube 




13 


63 


. 4 


7 . 


7 


53178 


23 


AAS59543 


Propionibacterium 




14 


60 


.8 


7 . 


3 


1022 


21 


AAC51469 


Arabidopsis thalia 




15 


60 


. 8 


7 . 


3 


1024 


21 


AAC45134 


Arabidopsis thalia 




16 


58 


. 6 


7 . 


1 


1250 


21 


AAC34656 


Arabidopsis thalia 




17 


58 


. 6 


7 . 


1 


1256 


21 


AAC47798 


Arabidopsis thalia 




18 


58 


. 6 


7 . 


1 


1310 


21 


AAC4 938 6 


Arabidopsis thalia 




19 


55 


.4 


6. 


7 


732 


23 


ABL18875 


Drosophila melanog 




20 


55 


. 4 


6. 


7 


2982 


23 


ABL18874 


Drosophila melanog 




21 


55 


.4 


6. 


7 


4878 


23 


ABL04246 


Drosophila melanog 




22 


54 


.2 


6. 


5 


414 


21 


AAC98274 


Human colon cancer 


c 


23 


53 


.8 


6. 


5 


1010 


22 


AAI60629 


Human polynucleoti 




24 


53 


.8 


6. 


5 


1015 


22 


AAI58843 


Human polynucleoti 




25 


53 


.8 


6. 


5 


1018 


21 


AAF21892 


Human breast and o 




26 


48 


.2 


5. 


8 


6132 


21 


AAC79584 


Virulence gene #4. 




27 


47 


.8 


5. 


8 


915 


21 


AAC41761 


Arabidopsis thalia 




28 


44 


. 6 


5 . 


4 


1076 


21 


AAC49387 


Arabidopsis thalia 




29 


44 


. 6 


5. 


4 


1078 


21 


AAC35307 


Arabidopsis thalia 




30 




42 


5. 


1 


4966 


22 


AAS36385 


Human cardiovascul 


c 


31 


41 


.2 


5 . 


0 


7002 


23 


ABL09667 


Drosophila melanog 


c 


32 


41 


.2 


5. 


0 


10637 


23 


ABL09666 


Drosophila melanog 


c 


33 




41 


5. 


0 


6558 


21 


AAZ45602 


cDNA sequence of a 


c 


34 


40 


.6 


4 . 


9 


592 


21 


AAZ93361 


Sequence encoding 


c 


35 


40 


. 6 


4 . 


9 


666 


22 


AAC8 4 60 4 


Human ZF4 protein 


c 


36 


40 


.2 


4 . 


9 


381 


23 


ABL41662 


Nucleotide sequenc 


c 


37 


39 


.6 


4. 


8 


3584 


23 


ABL07253 


Drosophila melanog 


c 


38 


39 


. 6 


4 . 


8 


13531 


23 


ABL07252 


Drosophila melanog 




39 


39 


.2 


4 . 


7 


645 


17 


AAT42476 


HuntL4 coding sequ 




40 


38 


.8 


4 . 


7 


1283 


23 


AAS85010 


DNA encoding novel 




41 


38 


.6 


4 . 


7 


2853 


22 


AAI58576 


Human polynucleoti 




42 


38 


.6 


4 . 


7 


2863 


22 


AAI60362 


Human polynucleoti 




43 


38 


. 6 


4 . 


7 


7238 


22 


AAS36386 


Human cardiovascul 




44 


38 


. 4 


4 . 


6 


804 


24 


ABK48294 


Degenerate DNA enc 


c 


45 


38 


.4 


4 . 


6 


2224 


19 


AAV35472 


Rat rSK3 coding se 



ALIGNMENTS 



RESULT 1 



GenCore version 5.1.3 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: January 11, 2003, 10:54:19 ; Search time 52 Seconds 

(without alignments) 
4883.238 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-602-740-1 
828 

1 gaggagcttcgccacatgga . 



. gcagcgccagctctaacaag 828 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 441362 seqs, 153338381 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



882724 



Post -processing : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB . seq: * 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3: /cgn2_6/ptodata/2/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/2/ina/6B_COMB. seq: * 

5: /cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 

6 : /cgn2_6/ptodata/2/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 



Score 



Query 

Match Length DB 



ID 



Description 



c 
c 

c 
c 
c 
c 
c 
c 
c 
c 
c 



9 
10 
11 
12 



149.6 
73.2 
40 
36.4 
36.2 
36.2 
36.2 
36. 
36. 
35. 
35. 



.2 
.2 

,6 



35 



18.1 
8.8 
4.8 
4 . 4 
4.4 
4.4 
4 . 4 
4.4 
4 . 4 
4.3 
4.3 
4.2 



4411529 
331 
2830 
477 
10348 
10348 
10348 
10366 
10366 
3819 
3819 
533 



US-09-103 
US-09-060- 
US-09-221- 
US-09-135- 
US-08-457- 
US-08-556- 
US-09-041- 
US-08-246- 
US-08-453- 
US-07-686- 
US-08-002- 
5482709-5 



-840A-1 

756-308 

017B-730 

994-1 

273B-41 

419-13 

886-14 

982A-5 

265-5 

322A-3 

999-3 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Patent No 



1, Appli 
308, App 
730, App 
1, Appli 
41, Appl 

13, Appl 

14, Appl 
5, Appli 
5, Appli 
3, Appli 
3, Appli 

5482709 



GenCore version 5.1.3 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



January 11, 2003, 11:34:33 ; Search time 58 Seconds 

(without alignments) 
6282.855 Million cell updates/sec 



Title: US-09-602-7 4 0-1 

Perfect score: 828 

Sequence: 1 gaggagcttcgccacatgga gcagcgccagctctaacaag 828 



Scoring table: 



Searched : 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

389086 seqs, 220051671 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



778172 



Database 



Published__Applications_NA: * 

/cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 
/cgn2_6/ptodata/2/pubpna/PCT_NEW__PUB. seq: * 
/cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 
/cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 
/cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 
/cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB.se.q: ' 
/cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 
/cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 
/cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 
/cgn2_6/ptodata/2/pubpna/US09_PUBCOMB. seq: " 
/cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: ' 
/cgn2_6/ptodata/2/pubpna/US10_PUBCOMB.seq: * 
/cgn2_6/ptodata/2/pubpna/US60_NEW_PUB.seq: * 
/cgn2_6/ptodata/2/pubpna/US60 PUBCOMB . seq : ' 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 



Score 



Query 

Match Length DB 



ID 



Description 



1 705 85.1 705 9 US-09-738-626-174 5 

2 81 9.8 957 9 US-09-738-626-1744 

3 54.2 6.5 414 10 US-09-925-299-284 

4 42 5.1 4966 10 US-09-764-869-1885 



Sequence 1745, Ap 
Sequence 174 4, Ap 
Sequence 284, App 
Sequence 1885, Ap 



GenCore version 5.1.3 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



January 11, 2003, 09:54:28 ; Search time 1967 Seconds 

(without alignments) 
6817.419 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-09-602-740-1 
828 

1 gaggagcttcgccacatgga . 



. gcagcgccagctctaacaag 828 



IDENTITYJSIUC 

Gapop 10.0 , Gapext 1.0 

16154066 seqs, 8097743376 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



32308132 



Database 



EST: 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



em_estba : * 

em_esthum: * 

em_estin : * 

em_estmu: * 

em_estov: * 

em_estpl : * 

em_estro : * 

em_htc : * 

gb_estl : * 
gb_est2 : * 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_est f un : * 
em_estom: * 
gb_gss : * 
em_gss_hum: * 
em_gss_inv: * 
em__gss_pln: * 
em_gss_vrt : * 
em_gss_f un : * 
em_gss_mam: * 
em_gss_mus : * 
em_gs soothe r : * 
em_gss_pro : * 
em gss rod: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

o 
o 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


72 . 4 


8 . 


. 7 


594 


14 


BQ250641 


BQ250641 


TaE25002B 




2 


70.8 


8 . 


. 6 


476 


10 


BE415351 


BE415351 


MWL028 . CO 




3 


70.8 


8 . 


. 6 


490 


10 


BE443267 


BE443267 


WHE1110 C 




4 


70.8 


8 . 


. 6 


533 


10 


BE443748 


BE443748 


WHE1122 G 




5 


70.8 


8 , 


. 6 


568 


10 


BE444985 


BE444985 


WHE1130 C 




6 


70.8 


8 , 


. 6 


578 


10 


BE443954 


BE443954 


WHE1123 D 




7 


70.8 


8 . 


. 6 


636 


14 


BQ805048 


BQ805048 


WHE3562_C 




8 


70.2 


8 


. 5 


665 


14 


BM815572 


BM815572 


EST593666 




9 


68. 6 


8 


. 3 


455 


12 


BG653158 


BG653158 


sad82f 05 . 




10 


68. 6 


8 


. 3 


601 


14 


BQ629730 


BQ629730 


sap92g05 . 




11 


67.8 


8 


. 2 


807 


14 


BQ804658 


BQ804658 


WHE3557 C 


c 


12 


67 . 4 


8 


. 1 


693 


10 


AW348678 


AW348678 


GM210003A 




13 


67. 4 


8 


. 1 


1264 


10 


BE420690 


BE420690 


HWM001.E0 


c 


14 


67 


8 


. 1 


713 


13 


BJ263815 


BJ263815 


BJ263815 


c 


15 


66 


8 


. 0 


571 


10 


BE443356 . 


BE443356 


WHE1112 B 




16 


66 


8 


. 0 


648 


13 


BJ304570 


BJ304570 


BJ304570 
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